Image processing apparatus, information processing apparatus, information processing method, and recording medium

ABSTRACT

An image processing apparatus captures an image, detects an object in the captured image, and transmits the image and metadata containing object information regarding the object detected from the image to an information processing apparatus. If a detection function is valid with respect to a first image to transmit and the detection function is invalid with respect to a second image to transmit after the first image, the image processing apparatus transmits invalidity information indicating that the object is not detected with respect to the second image to the information processing apparatus.

BACKGROUND OF THE INVENTION Field of the Invention

The present invention relates to a technique that analyzes a capturedimage to detect an object.

Description of the Related Art

Conventionally, there has been an imaging apparatus configured toconduct a video analysis on an image captured by the imaging apparatusand distribute information about an object detected as a result of thevideo analysis as metadata.

Further, generally, a command group for allowing a client apparatus toinstruct the imaging apparatus to change a setting and to start thedistribution of the image and the metadata is implemented on the imagingapparatus configured to distribute the captured image to the clientapparatus. For example, there has been known a command group defined bya standard formulated by Open Network Video Interface Forum (ONVIF) asan example of such a command group. ONVIF also defines a content of themetadata, and examples of this content of the metadata include objectinformation containing details of the object detected by the videoanalysis. Examples of the object information include positionalinformation indicating a position of the object detected in the image.

Further, Japanese Patent No. 5999395 discloses an imaging apparatus thatdetermines whether to distribute or not metadata based on a video typeof a distributed video image.

Then, in a case where the imaging apparatus uses a protocol that addsthe object information to the metadata only when there is a change inthe object information about the detected object, the client apparatusthat receives the metadata may continue holding the object informationreceived last time. At this time, when the detection function becomesinvalid with respect to the image that the imaging apparatus transmits,the client apparatus may continue holding the object informationreceived last time and determine that the object is continuouslydetected. In this case, the object information that the client apparatuscontinues holding is invalid information after the detection function ofthe imaging apparatus becomes invalid.

According to the technique discussed in Japanese Patent No. 5999395,even when the detection function becomes invalid on the imagingapparatus side, the client apparatus may incorrectly determine that theobject is continuously detected on the imaging apparatus side andundesirably present a display misleading a user.

SUMMARY OF THE INVENTION

To prevent the client apparatus from continuing holding the invalidobject information, according to an aspect of the present invention, animaging apparatus includes an image acquisition unit configured tocapture an image, a detection unit configured to detect an object in theimage captured by the image acquisition unit, and a transmission unitconfigured to transmit the image and metadata containing objectinformation regarding the object detected from the image by thedetection unit to an information processing apparatus. If a detectionfunction of the detection unit is valid with respect to a first imagethat the transmission unit transmits and the detection function isinvalid with respect to a second image that the transmission unittransmits after the first image, the transmission unit transmitsinvalidity information indicating that the object is not detected withrespect to the second image to the information processing apparatus.

Further features of the present invention will become apparent from thefollowing description of embodiments with reference to the attacheddrawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic view illustrating a system configuration.

FIG. 2 illustrates outlines of hardware configurations of an imagingapparatus and a client apparatus.

FIGS. 3A and 3B are functional block diagrams of the imaging apparatusand the client apparatus, respectively.

FIG. 4 illustrates a sequence of image distribution and metadatadistribution between the imaging apparatus and the client apparatus.

FIG. 5 is a schematic view of a screen displayed on a display apparatus.

FIGS. 6A and 6B are flowcharts of information addition processing andinformation update processing, respectively.

FIG. 7 illustrates an outline of an example of a description ofmetadata.

FIG. 8 is a sequence diagram regarding the image distribution and themetadata distribution between the imaging apparatus and the clientapparatus.

FIGS. 9A and 9B are flowcharts of information addition processing andinformation update processing, respectively.

FIG. 10 illustrates a sequence of the image distribution and themetadata distribution between the imaging apparatus and the clientapparatus.

FIG. 11 is a flowchart of information update processing.

DESCRIPTION OF THE EMBODIMENTS

In the following description, embodiments of the present invention willbe described with reference to the accompanying drawings. Configurationsthat will be described in the following embodiments are merely oneexample, and the present invention shall not be limited to theillustrated configurations.

In the following description, a first embodiment will be described. FIG.1 is a schematic view illustrating a system configuration according tothe present embodiment. An imaging apparatus 100 is an apparatus thatcaptures an image, and is, for example, a network camera or a monitoringcamera capable of transmitting the image via a network 103.

A client apparatus 101 is an information processing apparatus, and iscommunicably connected to the imaging apparatus 100 via the network 103.The client apparatus 101 transmits, to the imaging apparatus 100, eachcontrol command, such as a command requesting image distribution andmetadata distribution and a command for acquiring a type of an image onwhich an image analysis is conducted. The imaging apparatus 100transmits, to the client apparatus 101, a response to the commandtransmitted from the client apparatus 101, an image in response to theimage distribution request, and metadata in response to the metadatadistribution request.

A display apparatus 102 is connected to the client apparatus 101, andis, for example, a display that displays, for example, an image based onthe image data transmitted from the imaging apparatus 100.

Further, the imaging apparatus 100 and the client apparatus 101communicate with each other via the network 103. The network 103includes a plurality of routers, a plurality of switches, and/or aplurality of cables that satisfy a communication standard such asETHERNET (registered trademark). The network 103 may be constructed withuse of the Internet, a wired local area network (LAN), a wireless LAN, awide area network (WAN), or the like.

Next, hardware configurations of the imaging apparatus 100 and theclient apparatus 101 will be described with reference to FIG. 2. FIG. 2illustrates outlines of the hardware configurations of the imagingapparatus 100 and the client apparatus 101.

First, the hardware configuration of the imaging apparatus 100 will bedescribed. A central processing unit (CPU) 200 comprehensively controlsthe imaging apparatus 100.

An interface (I/F) 201 is in charge of the communication with the clientapparatus 101 according to a protocol such as Open Network VideoInterface Forum (ONVIF) via the network 103. The I/F 201, for example,transmits the image and the metadata to the client apparatus 101, andreceives, for example, various kinds of commands for controlling theimaging apparatus 100 from the client apparatus 101.

A random access memory (RAM) 202 temporarily stores therein a computerprogram that the CPU 200 executes. Further, the RAM 202 temporarilystores therein data acquired from outside via the I/F 201 (for example,the command, and a parameter regarding an imaging condition). Further,the RAM 202 provides a work area used when the CPU 200 performs, forexample, flowcharts illustrated in FIGS. 6A and 9A and sequencesillustrated in FIGS. 4, 8, and 10, which will be described below.Further, the RAM 202, for example, functions as a frame memory andfunctions as a buffer memory.

A read only memory (ROM) 203 stores therein, for example, a program usedfor the CPU 200 to control the imaging apparatus 100. The imagingapparatus 100 may be configured to include a secondary storage devicesimilar to a hard disk drive (HDD) 213, which will be described below,in addition to the ROM 203.

An imaging device 204 includes a lens group and an image sensor thatform an imaging optical system. The lens group includes an optical lensfor forming an image from incident light and collects the incident lighton the image sensor. The image sensor includes a charge coupled device(CCD) sensor, a complementary metal oxide semiconductor (CMOS) sensor,or the like. Then, the imaging device 204 generates an electric signalby photoelectrically converting a subject image that is formed on theimage sensor by being transmitted through the lens group of the imagingapparatus 100. Further, the imaging device 204 generates image data byperforming processing for converting the generated electric signal intoa digital signal and encoding processing based on a format such as JointPhotographic Experts Group (JPEG), H.264, and H.265.

Next, the hardware configuration of the client apparatus 101 will bedescribed. A CPU 210 controls the client apparatus 101.

An I/F 211 is in charge of the communication with the imaging apparatus100 according to the protocol such as ONVIF via the network 103.Further, the I/F 211, for example, receives the image transmitted fromthe imaging apparatus 100, and transmits, for example, the various kindsof commands for controlling the imaging apparatus 100.

A RAM 212 temporarily stores therein a computer program that the CPU 210executes. Further, the RAM 212 temporarily stores therein, for example,data acquired from outside via the I/F 211 (the command and the imagedata). Further, the RAM 212 provides a work area used when the CPU 210performs, for example, flowcharts illustrated in FIGS. 6B, 9B, and 11and the sequences illustrated in FIGS. 4, 8, and 10, which will bedescribed below. Further, the RAM 212, for example, functions as a framememory and functions as a buffer memory.

The CPU 210 executes a computer program stored in the RAM 212. Aprocessor such as a digital signal processor (DSP) may also be usedbesides the CPU.

The HDD 213 stores therein a program of an operating system and theimage data. The computer program and the data for performing theflowcharts and the sequences that will be described below are stored inthe HDD 213 loaded into the RAM 212 as necessary according to thecontrol by the CPU 210 and executed by the CPU 210. Other storage media,such as a flash memory, may also be used besides the HDD.

A casing of the client apparatus 101 may be prepared integrally with thedisplay apparatus 102. A user interface (UI) 214 is an input device,such as a keyboard and a mouse.

In the following description, the present embodiment will be describedreferring to an example in which the CPU 210 performs the flowcharts andthe sequences that will be described below, but may be configured insuch a manner that at least a part of the processing of the CPU 210 isperformed by dedicated hardware. For example, processing for displayinga graphical user interface (GUI) and the image data on the displayapparatus 102 may be performed by a graphics processing unit (GPU).Further, processing for reading out a program code from the HDD 213 todevelop it into the RAM 212 may be performed by a direct memory access(DMA) functioning as a transfer device.

Next, the imaging apparatus 100 according to the present embodiment willbe described with reference to FIG. 3A. FIG. 3A is a diagramillustrating a functional block of the imaging apparatus 100. In FIG.3A, the imaging apparatus 100 includes an image acquisition unit 300, acontrol unit 301, a detection unit 302, a communication unit 303, and astorage unit 304. In the following description, each of the functionalblocks included in the imaging apparatus 100 illustrated in FIG. 3A isrealized by the CPU 200 executing the program stored in the ROM 203.

The image acquisition unit 300 acquires the image data generated by theimaging device 204.

The control unit 301 controls overall processing of the imagingapparatus 100 according to the processing performed by the CPU 200.

The detection unit 302 performs processing for detecting an object inthe image acquired by the image acquisition unit 300. Object informationabout the object detected by the detection unit 302 is transmitted tothe communication unit 303 and/or the storage unit 304.

The detection unit 302 according to the present embodiment carries outpattern matching using a collation pattern (a dictionary), and detects ahuman body in the image. The detection unit 302 may employs anothermethod as long as having a function of detecting a certain object set asa target that a user wants to detect from the image. The processingperformed by the detection unit 302 may be known moving objectdetection, skin color detection, or the like, and is not limited to thepattern matching processing. Further, in the following description, thedetection unit 302 according to the present embodiment will be describedassuming that the human body is set as the target for the detection, butthe target for the detection may be, for example, a human face, avehicle, or an animal.

The communication unit 303 receives each control command from the clientapparatus 101 via the I/F 201. Further, the communication unit 303, forexample, transmits the image and the metadata to the client apparatus101 based on each control command received from the client apparatus 101and transmits an event notification to the client apparatus 101 via theI/F 201.

Further, the communication unit 303 according to the present embodimentperforms the following processing. That is, if the detection function ofthe detection unit 302 is valid with respect to a first image that thecommunication unit 303 transmits and the detection function is invalidwith respect to a second image that the communication unit 303 transmitsafter the first image, the communication unit 303 transmits invalidityinformation indicating that the object is not detected with respect tothe second image.

The invalidity information according to the present embodiment ismetadata indicating that the object is not detected. Further, thedetection function according to the present embodiment means thefunction of detecting the object by the detection unit 302. Further, inthe present embodiment, the detection function is determined to be validif an image type of a distribution image that the communication unit 303transmits and an image type of an image on which the detection unit 302performs the processing for detecting the object match each other, andis determined to be invalid if these image types do not match eachother. This is because, even if the detection function is exerted withrespect to an image little relevant to the distribution image, adetection result thereof is little worthwhile for the user.

The storage unit 304 stores therein a setting value indicating aninstallation state of the camera, such as installation on a ceiling andinstallation on a wall, a setting value of the image type of thedistribution image, a setting value of the image type of the image onwhich the video analysis is conducted, the image data acquired by theimage acquisition unit 300, and the like. The image type in the presentexample refers to a type of the image generated by the imaging device204, and examples thereof include a fisheye image captured by a fisheyecamera and a panoramic image generated by carrying out an imagecorrection and an image cutout on the fisheye image.

Further, in the present embodiment, the video analysis will be describedassuming that it is the processing for detecting the object from theimage by the detection unit 302.

Next, the client apparatus 101 according to the present embodiment willbe described with reference to FIG. 3B. FIG. 3B is a functional blockdiagram of the client apparatus 101. In FIG. 3B, the client apparatus101 includes a control unit 330, a communication unit 331, a displayunit 332, and a storage unit 333. In the following description, each ofthe functional blocks included in the client apparatus 101 like theexamples illustrated in FIG. 3B is executed by the CPU 210.

The control unit 330 controls overall processing of the client apparatus101 according to the processing performed by the CPU 210.

The communication unit 331 transmits, to the imaging apparatus 100 viathe I/F 211, each control command, such as the command requesting theimage distribution and the metadata distribution and the command foracquiring the image type of the image on which the video analysis isconducted. Further, the communication unit 331 receives the response toeach control command that is transmitted from the imaging apparatus 100,the captured image and the metadata, the event notification, and thelike from the imaging apparatus 100 via the I/F 211.

The display unit 332 causes the display apparatus 102 to output thereonthe image received via the communication unit 331. Further, the displayunit 332 causes the display apparatus 102 to output thereon informationbased on the object information about the object detected by thedetection unit 302 of the imaging apparatus 100. The display unit 332according to the present embodiment causes the display apparatus 102 tooutput thereon a bounding rectangle indicating the position of theobject in the image while superimposing the bounding rectangle on thereceived image, based on the object information about the objectdetected by the detection unit 302.

The storage unit 333 is used as an area storing therein various kinds ofdata, such as the image data of the image captured by the imagingapparatus 100 and the image type of the image on which the videoanalysis is conducted by the imaging apparatus 100.

Next, a sequence in which the imaging apparatus 100 provides the imagedistribution and the metadata distribution to the client apparatus 101will be described with reference to FIG. 4. FIG. 4 illustrates thesequence in which the image distribution and the metadata distributionare provided from the imaging apparatus 100 to the client apparatus 101according to the present embodiment.

The control unit 330 of the client apparatus 101 controls thecommunication unit 331 to cause it to transmit a command requesting theimage distribution and the metadata distribution in which the type ofthe image is specified so as to receive the image distribution and themetadata distribution from the imaging apparatus 100.

The communication unit 303 of the imaging apparatus 100 receives thecommand requesting the image distribution and the metadata distribution.

The control unit 301 of the imaging apparatus 100 generates thedistribution image according to the image type specified by the clientapparatus 101 and the metadata by performing information additionprocessing 400 for adding the object information to the metadata, basedon the received command. The information addition processing 400according to the present embodiment will be described in detail belowwith reference to FIGS. 6A and 7.

The control unit 301 of the imaging apparatus 100 controls thecommunication unit 303 to cause it to transmit the generated image andmetadata to the client apparatus 101. Then, the communication unit 331of the client apparatus 101 receives the transmitted image and metadata.If the client apparatus 101 receives metadata containing the objectinformation or metadata indicating that the object is not detected, theclient apparatus 101 performs information update processing 401. Theinformation update processing 401 according to the present embodimentwill be described in detail below with reference to FIG. 6B.

A protocol of the command for use in the communication between theclient apparatus 101 and the imaging apparatus 100 illustrated in FIG. 4may be ONVIF or may be a control protocol specific to the monitoringcamera that is prepared by each manufacturer.

Next, processing performed by the display unit 332 of the clientapparatus 101 will be described with reference to FIG. 5. FIG. 5illustrates one example of a viewer screen. A viewer screen 500 in FIG.5 is displayed on the display apparatus 102 by the display unit 332 ofthe client apparatus 101. The display unit 332 causes the displayapparatus 102 to present a display thereon while superimposing abounding rectangle 502 of the object information contained in thereceived metadata on the image 501 received from the imaging apparatus100.

Next, the information addition processing 400 according to the presentembodiment will be described with reference to FIGS. 6A and 7. FIG. 6Ais a flowchart illustrating a flow of the processing for adding theobject information or the information about no detection of the objectwhich indicates that the object is not detected to the metadata. Theprocessing according to the flowchart illustrated in FIG. 6A will bedescribed assuming that it is performed by each of the functional blocksillustrated in FIG. 3A that are realized by the CPU 200 executing thecomputer program stored in the ROM 203 of the imaging apparatus 100.

In step S601, the control unit 301 acquires information indicatingwhether the detection function of the detection unit 302 is valid orinvalid. At this time, the control unit 301 according to the presentembodiment determines that the detection function is valid if the imagetype of the distribution image that the communication unit 303 transmitsand the image type of the image on which the detection unit 302 performsthe processing for detecting the object match each other, and determinesthat the detection function is invalid if these image types do not matcheach other. For example, the control unit 301 determines that thedetection function of the detection unit 302 is invalid with respect tothe image that the communication unit 303 transmits if the distributionimage that the communication unit 303 transmits is the fisheye image andthe image on which the detection unit 302 performs the processing fordetecting the object is the panoramic image.

To achieve this determination, the control unit 301 according to thepresent embodiment acquires the image type of the distribution image andthe image type of the image on which the video analysis is conducted. Atthis time, the control unit 301 according to the present embodimentacquires the setting value of the image type of the distribution imageand the setting value indicating the installation state of the camerafrom the storage unit 304. Then, the control unit 301 determines thatthe image type of the image on which the video analysis is conducted isthe fisheye image if the installation state of the imaging apparatus 100is the installation on a ceiling, and determines that the image type ofthe image on which the video analysis is conducted is the panoramicimage if the installation state of the camera is the installation on awall. In this manner, the control unit 301 according to the presentembodiment acquires the image type of the image on which the videoanalysis is conducted based on the installation state of the imagingapparatus 100.

The control unit 301 may be configured to acquire the setting value ofthe image type of the distribution image and the setting value of theimage type on which the video analysis is conducted from the storageunit 304 in the case where the setting value of the image type on whichthe video analysis is conducted is stored in the storage unit 304.

Then, the control unit 301 according to the present embodiment acquiresthe information about the detection function by determining whether thedetection function of the detection unit 302 is valid or invalid withrespect to the image that the communication unit 303 transmits based onthe acquired image type of the distribution image and image type onwhich the video analysis is conducted.

In step S602, the storage unit 304 stores an image identification (ID)indicating a transmission order of the image that the communication unit303 transmits, and the information indicating whether the detectionfunction of the detection unit 302 is valid or invalid that isdetermined by the control unit 301 with respect to this image, inassociation with each other. For example, the control unit 301 assigns“1” as the image ID to an image that is transmitted first via thecommunication unit 303, and assigns “2” as the image ID to an image thatis transmitted second. Then, the storage unit 304 stores therein theimage ID and the information about the detection function with respectto the image corresponding to the image ID in association with eachother.

In step S603, the control unit 301 determines whether the detectionfunction of the detection unit 302 is valid or not with respect to theimage that the communication unit 303 transmits. If the detectionfunction is valid (YES in step S603), the processing proceeds to stepS604. If the detection function is invalid (NO in step S603), theprocessing proceeds to step S606.

If there is a change in the object information distributed in themetadata distribution last time in step S604 (YES in step S604), theprocessing proceeds to step S605. If there is no change in step S604 (NOin step S604), the processing ends.

Next, in step S605, the control unit 301 of the imaging apparatus 100adds the object information about the object detected by the detectionunit 302 to the metadata. Then, the processing ends.

In step S606, the control unit 301 determines whether the detectionfunction of the detection unit 302 has been valid or invalid withrespect to the image transmitted last time based on the image IDindicating the transmission order of the image and the informationindicating whether the transmission function is valid or invalid withrespect to this image that are stored in the storage unit 304. At thistime, if the detection function has been valid with respect to the imagetransmitted by the communication unit 303 last time (YES in step S606),the processing proceeds to step S607. If the detection function has beeninvalid (NO in step S606), the processing ends. Further, the processingends in step S604 if there is no image transmitted last time, i.e., theinformation addition processing 400 is performed for the first timeafter the request for the image distribution and the metadatadistribution is transmitted from the client apparatus 101 to the imagingapparatus 100.

In step S607, the control unit 301 adds the information indicating thatthe detection unit 302 does not detect the object to the metadata. Then,the processing ends.

The metadata generated by the above-described information additionprocessing 400 is transmitted to the client apparatus 101.

Next, the information update processing 401 performed by the clientapparatus 101 illustrated in FIG. 6B will be described. The clientapparatus 101 performs the information update processing 401 uponreceipt of the metadata containing the object information or themetadata indicating that the object is not detected.

If the metadata does not contain the information indicating that theobject is not detected in step S608 (NO in step S608), the processingproceeds to step S609. If the metadata contains the informationindicating that the object is not detected in step S608 (YES in stepS608), the processing proceeds to step S611.

In step S609, the control unit 330 of the client apparatus 101 updatesthe object information with the object information contained in themetadata received from the imaging apparatus 100. Then, the processingproceeds to step S610.

In step S610, the display unit 332 displays the object informationupdated in step S610 on the display apparatus 102 while superimposing iton the distribution image. Then, the processing ends. In the presentembodiment, the display unit 332 causes the display apparatus 102 todisplay thereon the bounding rectangle indicating the position of thedetected object while superimposing it on the distribution image basedon the updated object information.

In step S611, the control unit 330 of the client apparatus 101 updatesthe object information with the information indicating that the objectis not detected.

If the object information is not superimposed on the distribution imagedisplayed on the display apparatus 102 in step S612 (NO in step S612),the processing ends. If the object information is superimposed on thedistribution image in step S612 (YES in step S612), the processingproceeds to step S613.

In step S613, the display unit 332 deletes the bounding rectangle basedon the object information that is superimposed on the distribution imagedisplayed on the display apparatus 102.

If the image type of the distribution image and the image type on whichthe video analysis is conducted do not match each other, the objectinformation may be undesirably superimposed at an invalid position whenbeing superimposed on the distribution image on the client side. Forexample, suppose that the imaging apparatus 100 according to the presentembodiment is capable of capturing the fisheye image with use of thefisheye lens and is further capable of generating the panoramic image bycarrying out the distortion correction and the image cutout on thefisheye image.

At this time, if the distribution image is the fisheye image and thevideo analysis is conducted on the panoramic image, the fisheye image isdistributed but the object information about the object detected on thepanoramic image is provided as the object information distributed in themetadata distribution. As a result, the object information detected inthe panoramic image is superimposed on the fisheye image. This meansthat the bounding rectangle is superimposed at a position irrelevant tothe content of the fisheye image, and therefore the detection functionis no longer valid (becomes invalid) with respect to the imagetransmitted to the client apparatus 101.

Further, besides that, for example, when the distribution image is thefisheye image and the video analysis is conducted on the image cut outfrom the fisheye image, the bounding rectangle is also superimposed at aposition irrelevant to the content of the fisheye image in a similarmanner and therefore the detection function becomes invalid with respectto the image transmitted to the client apparatus 101.

To address this inconvenience, the imaging apparatus 100 according tothe present embodiment acquires the image type on which the imagingapparatus 100 conducts the video analysis and the image type of thedistribution image. Then, if the image type of the distribution imageand the video type on which the video analysis is conducted do not matcheach other, the imaging apparatus 100 handles the image to betransmitted as one for which the detection function of the detectionunit 302 is invalid, and adds the information indicating that any objectis not detected to the metadata and transmits it to the client apparatus101. By this operation, the client apparatus 101 can be prevented fromholding the invalid object information.

In step S601, the control unit 301 according to the present embodimentacquires the information about the detection function by determiningwhether the detection function is valid or not with respect to the imagethat the communication unit 303 transmits based on the image type of thedistribution image and the image type on which the video analysis isconducted, but is not limited thereto.

For example, the control unit 301 may be configured to acquireinformation indicating whether or not the imaging apparatus 100 is setto perform the processing for detecting the object by the detection unit302 with respect to the image to transmit. In this case, if the imagingapparatus 100 is set to perform the processing for detecting the objectby the detection unit 302, the control unit 301 determines that thedetection function of the detection unit 302 is valid with respect tothe image to transmit. Then, if the imaging apparatus 100 is set not toperform the processing for detecting the object by the detection unit302, the control unit 301 determines that the detection function of thedetection unit 302 is invalid with respect to the image to transmit.

In step S602, the storage unit 304 according to the present embodimentstores therein the image ID indicating the transmission order of theimage that the communication unit 303 transmits and the informationindicating whether the detection function is valid or invalid withrespect to the image in association with each other, but is not limitedthereto. For example, the storage unit 304 may be configured to storetherein the information about the detection function with respect to theimage that the communication unit 303 has transmitted last time and theinformation about the detection function with respect to the image thatthe communication unit 303 transmits at this time.

Next, the information indicating that the object is not detected will bedescribed based on the metadata in compliance with the ONVIF standard byway of example with reference to FIG. 7. FIG. 7 illustrates an exampleof a description of the metadata in compliance with the ONVIF standard.

ONVIF defines the metadata in Extensible Markup Language (XML), and datatherein is hierarchically structured. Metadata 700 is one example of themetadata to which the object information is added in step S605. In themetadata 700, a <MetadataStream> tag indicates the entire metadata, andincludes the object information about the object, which is a result ofthe video analysis, a pan/tilt/zoom (PTZ) position of the imagingapparatus 100, an event, and the like. A <Frame> tag in <VideoAnalytics>indicates the result of the video analysis. Further, an <Object> tag inthe <Frame> tag indicates the object information about the objectdetected by the detection unit 302 in the image.

Further, a <BoundingBox> tag in the <Object> tag indicates the boundingrectangle representing the position of the object, and is expressed withuse of two points, a left and top point, and a right and bottom point onx and y coordinates in the distribution image.

Metadata 701 is one example of the metadata to which the informationindicating that the detection unit 302 does not detect the object isadded in step S607. In this case, the metadata 701 includes the <Frame>tag, which indicates the result of the video analysis, but lacks the<Object> tag in the <Frame> tag, and this indicates that any object isnot detected.

If the determination in step S604 or step S606 is NO, the metadata thatdoes not contain the <VideoAnalytics> tag is generated and transmittedto the client apparatus 101. In a case where the client apparatus 101receives the metadata but there is the object information received inthe past, the client apparatus 101 continues holding the objectinformation.

The metadata illustrated in FIG. 7 is one example according to thepresent embodiment, and is not intended to limit the format of itthereto. A format specific to ONVIF or the monitoring camera can be usedtherefor, and any format may be used as long as the employed format canexpress the metadata indicating that the object is detected and themetadata indicating that the object is not detected.

In the above-described manner, the control unit 301 according to thepresent embodiment determines whether the detection function of thedetection unit 302 is valid with respect to a first image that thecommunication unit 303 transmits and the detection function becomesinvalid with respect to a second image that the communication unit 303transmits after the first image. Then, if the detection function becomesinvalid with respect to the second image, the control unit 301 adds, tothe metadata, the information indicating that the object is not detectedwith respect to the second image, and the metadata is transmitted to theclient apparatus 101 via the communication unit 303. By this operation,the client apparatus 101 can be prevented from continuing holding theinvalid object information.

In the following description, a second embodiment will be described. Thecontrol unit 301 of the imaging apparatus 100 according to the secondembodiment controls the communication unit 303 to cause it to transmitoccurrence of a change in the detection function from the validity tothe invalidity to the client apparatus 101 as an event notification. Inthe following description, processing performed by the imaging apparatus100 and the client apparatus 101 according to the second embodiment willbe described with reference to FIGS. 8 and 9. The same or similarfunctional configuration and processing as or to the first embodimentwill be identified by the same reference numeral, and a redundantdescription thereof will be omitted as appropriate.

The communication unit 303 according to the present embodiment performsthe following processing. That is, if the detection function of thedetection unit 302 is valid with respect to the first image that thecommunication unit 303 transmits and the detection function is invalidwith respect to the second image that the communication unit 303transmits after the first image, the communication unit 303 transmitsthe invalidity information indicating that the object is not detectedwith respect to the second image. At this time, the invalidityinformation according to the present embodiment is the eventnotification indicating that the detection function is changed from thevalidity to the invalidity.

A sequence in which the client apparatus 101 receives the image and themetadata from the imaging apparatus 100 will be described with referenceto FIG. 8. FIG. 8 illustrates the sequence in which the client apparatus101 receives the image and the metadata from the imaging apparatus 100.

The control unit 330 of the client apparatus 101 controls thecommunication unit 331 to transmit the command that contains theinformation for specifying the image type of the distribution image andthe instruction requesting the image distribution and the metadatadistribution so as to receive the image distribution and the metadatadistribution from the imaging apparatus 100.

The communication unit 303 of the imaging apparatus 100 receives thecommand requesting the image distribution and the metadata distribution.Then, the control unit 301 generates the distribution image based on theimage type specified by the client apparatus 101 and the metadata byprocessing for adding the object information to the metadata accordingto information addition processing 800.

The control unit 301 controls the communication unit 303 to transmit thegenerated distribution image and metadata to the client apparatus 101.Then, the communication unit 331 of the client apparatus 101 receivesthe transmitted distribution image and metadata.

Further, in the information addition processing 800, the control unit301 of the imaging apparatus 100 determines that the detection functionof the detection unit 302 is changed from the validity to the invaliditywith respect to the image that the imaging apparatus 100 transmits, andissues the event notification indicating that the detection function ischanged if the detection function is changed from the validity to theinvalidity. At this time, the control unit 301 controls thecommunication unit 303 to issue the event notification to the clientapparatus 101.

The communication unit 331 of the client apparatus 101 receives thetransmitted event notification.

Upon receipt of the metadata containing the object information or theevent notification, the control unit 330 of the client apparatus 101performs information update processing 801.

A protocol of the command for use in the communication between theclient apparatus 101 and the imaging apparatus 100 illustrated in FIG. 8may be ONVIF or may be a control protocol specific to the monitoringcamera that is prepared by each manufacturer.

Next, the information addition processing 800 according to the presentembodiment will be described with reference to FIGS. 9A and 9B. FIG. 9Ais a flowchart illustrating a flow of the information additionprocessing 800 performed by the imaging apparatus 100 according to thepresent embodiment. FIG. 9B is a flowchart illustrating a flow of theinformation update processing 801 performed by the client apparatus 101according to the present embodiment.

The processing according to the flowchart illustrated in FIG. 9A will bedescribed assuming that it is performed by each of the functional blocksillustrated in FIG. 3A that are realized by the CPU 200 executing thecomputer program stored in the ROM 203 of the imaging apparatus 100.

Further, the processing according to the flowchart illustrated in FIG.9B will be described assuming that it is performed by each of thefunctional blocks illustrated in FIG. 3B that are realized by the CPU210 executing the computer program stored in the HDD 213 of the clientapparatus 101.

First, the information addition processing 800 illustrated in FIG. 9Awill be described.

Processing from step S601 to step S606 is similar to the firstembodiment, and therefore a description thereof will be omitted here.

In step S901, the control unit 301 according to the present embodimentissues the event notification indicating that the detection function ofthe detection unit 302 is changed from the validity to the invaliditywith respect to the image that the imaging apparatus 100 transmits.

Then, the communication unit 331 of the client apparatus 101 receivesthe event notification as described with reference to FIG. 8.

If the determination in step S604 or step S606 is NO or if thedetermination in step S606 is YES, the metadata lacking the<VideoAnalytics> tag therein is generated and transmitted to the clientapparatus 101. In a case where the client apparatus 101 receives themetadata but there is the object information received in the past, theclient apparatus 101 continues holding the object information.

Next, the information update processing 801 performed by the clientapparatus 101 illustrated in FIG. 9B will be described. The informationupdate processing 801 illustrated in FIG. 9B is performed when theclient apparatus 101 receives the metadata containing the objectinformation or the event notification from the imaging apparatus 100.

If the client apparatus 101 receives the event notification in step S902(YES in step S902), the processing proceeds to step S611. If the clientapparatus 101 does not receive the event notification in step S902 (NOin step S902), the processing proceeds to step S609.

Processing from step S609 to step S613 is similar to the firstembodiment, and therefore a description thereof will be omitted here.

In the above-described manner, the client apparatus 101 according to thesecond embodiment receives the occurrence of the change in the detectionfunction of the imaging apparatus 100 from the validity to theinvalidity as the event notification, and updates the object informationinto the information indicating that the object is not detected. By thisoperation, the client apparatus 101 that receives the distribution imageand the metadata can be prevented from holding the invalid objectinformation.

In the following description, a third embodiment will be described. Theclient apparatus 101 according to the third embodiment inquires of theimaging apparatus 100 about the occurrence of the change in thedetection function of the imaging apparatus 100 from the validity to theinvalidity (polling). In the following description, information updateprocessing 1002 in which the client apparatus 101 according to the thirdembodiment updates the object information will be described withreference to FIGS. 10 and 11. The same or similar functionalconfiguration and processing as or to the first embodiment will beidentified by the same reference numeral, and a redundant descriptionthereof will be omitted as appropriate.

A sequence in which the client apparatus 101 receives the image and themetadata from the imaging apparatus 100 will be described with referenceto FIG. 10. FIG. 10 illustrates the sequence in which the clientapparatus 101 receives the image and the metadata from the imagingapparatus 100.

The control unit 330 of the client apparatus 101 controls thecommunication unit 331 to transmit the command requesting the imagedistribution and the metadata distribution in which the type of theimage is specified so as to receive the image distribution and themetadata distribution from the imaging apparatus 100.

The communication unit 303 of the imaging apparatus 100 receives thecommand requesting the image distribution and the metadata distributionthat is transmitted from the client apparatus 101.

The control unit 301 of the imaging apparatus 100 generates thedistribution image according to the image type specified by the clientapparatus 101 and the metadata by processing for adding the objectinformation to the metadata according to information addition processing1000 based on the received command. The information addition processing1000 according to the present embodiment adds the metadata to the objectinformation if there is a change in the object information transmittedlast time. On the other hand, the metadata lacking the <VideoAnalytics>tag therein is generated and transmitted to the client apparatus 101 ifthere is no change in the object information transmitted last time. Evenwhen the client apparatus 101 receives this metadata, the clientapparatus 101 continues holding this object information in a case wherethere is the object information received in the past. However, if thereis no object information transmitted last time, i.e., the imagingapparatus 100 performs the information addition processing 1000 for thefirst time after receiving the request for the image distribution andthe metadata distribution from the client apparatus 101, the objectinformation is added to the metadata.

The control unit 301 of the imaging apparatus 100 controls thecommunication unit 303 to transmit the generated image and metadata tothe client apparatus 101. Then, the communication unit 331 of the clientapparatus 101 receives the transmitted image and metadata.

After the communication unit 331 receives the transmitted image andmetadata, the control unit 330 controls the communication unit 331 totransmit a command requesting acquisition of the information indicatingwhether the detection function of the detection unit 302 is valid orinvalid with respect to the received image. At this time, the controlunit 330 according to the present embodiment controls the communicationunit 331 to transmit a command requesting the image type on which thevideo analysis is conducted.

The communication unit 303 of the imaging apparatus 100 receives thiscommand, and the control unit 301 acquires the information indicatingwhether the detection function is valid or invalid by acquisitionprocessing 1001. At this time, the control unit 301 acquires the imagetype on which the image analysis is conducted by the acquisitionprocessing 1001 according to the present embodiment. In this case, thecontrol unit 301 may be configured to acquire, from the storage unit304, the setting value of the image type on which the video analysis isconducted or may be configured to acquire the image type on which thevideo analysis is conducted based on the installation state of theimaging apparatus 100.

The control unit 301 may be configured to acquire the informationindicating whether or not the imaging apparatus 100 is set to performthe processing for detecting the object by the detection unit 302 in theacquisition processing 1001. In this case, if the imaging apparatus 100is set to perform the processing for detecting the object by thedetection unit 302, the control unit 301 determines that the detectionfunction of the detection unit 302 is valid with respect to thedistribution image. Then, if the imaging apparatus 100 is set not toperform the processing for detecting the object by the detection unit302, the control unit 301 determines that the detection function of thedetection unit 302 is invalid with respect to the distribution image.

The control unit 301 sets the acquired information regarding thedetection function into a response to the command and transmits it fromthe communication unit 303 to the client apparatus 101 as the response.At this time, in the present embodiment, the control unit 301 sets theacquired image type on which the image analysis is conducted into theresponse to the command.

The communication unit 331 of the client apparatus 101 receives thisresponse and performs object information update processing 1002.

A protocol of the command for use in the communication between theclient apparatus 101 and the imaging apparatus 100 illustrated in FIG.10 may be ONVIF or may be a control protocol specific to the monitoringcamera that is prepared by each manufacturer.

Next, the object information update processing 1002 performed by theclient apparatus 101 according to the present embodiment will bedescribed with reference to FIG. 11. FIG. 11 is a flowchart illustratinga flow of the object information update processing 1002 performed by theclient apparatus 101.

The processing according to the flowchart illustrated in FIG. 11 will bedescribed assuming that it is performed by each of the functional blocksillustrated in FIG. 3B that are realized by the CPU 210 executing thecomputer program stored in the HDD 213 of the client apparatus 101.

In step S1101, the control unit 330 acquires the information indicatingwhether the detection function of the detection unit 302 is valid orinvalid with respect to the received image. The control unit 330according to the present embodiment determines that the detectionfunction is valid if the image type of the distribution image receivedby the communication unit 331 and the image type of the image on whichthe detection unit 302 performs the processing for detecting the objectmatch each other, and determines that the detection function is invalidif these image types do not match each other.

In this case, in step S1101, the control unit 330 according to thepresent embodiment acquires the image type of the distribution image andthe image type of the image on which the video analysis is conducted.The control unit 330 acquires the setting value of the image type of thedistribution image from the command requesting the image distribution tothe imaging apparatus 100 in which the type of the image is specified.Further, the control unit 330 acquires the image type of the image onwhich the video analysis is conducted from the response to the commandrequesting the acquisition of the information indicating whether thedetection function of the detection unit 302 is valid or invalid withrespect to the received image.

Then, the control unit 330 according to the present embodimentdetermines whether the detection function of the detection unit 302 isvalid or not with respect to the image that the communication unit 303transmits based on the acquired image type of the distribution image andimage type on which the video analysis is conducted.

In step S1102, the storage unit 333 stores the image ID indicating areception order of the image received by the communication unit 331, andthe information indicating whether the detection function of thedetection unit 302 is valid or invalid that is determined by the controlunit 330 with respect to the image, in association with each other.

In step S1103, the control unit 330 determines whether the detectionfunction of the detection unit 302 is valid or not with respect to theimage received by the communication unit 331. If the detection functionis valid (YES in step S1103), the processing proceeds to step S609. Ifthe detection function is invalid (NO in step S1103), the processingproceeds to step S1104.

Step S609 and step S610 are similar to the first embodiment, andtherefore descriptions thereof will be omitted here.

In step S1104, the control unit 330 determines whether the detectionfunction of the detection unit 302 has been valid or invalid withrespect to the image received last time, based on the image IDindicating the reception order of the received image and the informationindicating whether the detection function is valid or invalid withrespect to the image that are stored in the storage unit 333. At thistime, if the detection function has been valid with respect to the imagereceived by the communication unit 331 last time (YES in step S1104),the processing proceeds to step S611. If the detection function has beeninvalid (NO in step S1104), the processing ends. The processing proceedsto NO in step S1104 and ends if there is no image received last time,i.e., if the information addition processing 1000 is performed for thefirst time after the client apparatus 101 transmits the commandrequesting the image distribution and the metadata distribution to theimaging apparatus 100.

Processing from step S611 to step S613 is similar to the firstembodiment, and therefore a description thereof will be omitted here.

In the above-described manner, the client apparatus 101 according to thethird embodiment acquires the information indicating whether thedetection function of the detection unit 302 conducting the videoanalysis is valid or not with respect to the received image from theimaging apparatus 100. Then, the client apparatus 101 determines whetherthe detection function of the detection unit 302 is valid with respectto the first image received by the communication unit 331 and thedetection function becomes invalid with respect to the second imagereceived after the first image. Then, if the detection function becomesinvalid with respect to the second image, the control unit 330 updatesthe object information with the information indicating that the objectis not detected. By this operation, the client apparatus 101 can beprevented from holding the invalid object information.

According to each of the above-described embodiments, the clientapparatus 101 can be prevented from continuing holding the invalidobject information.

Other Embodiments

Embodiment(s) of the present invention can also be realized by acomputer of a system or apparatus that reads out and executes computerexecutable instructions (e.g., one or more programs) recorded on astorage medium (which may also be referred to more fully as a‘non-transitory computer-readable storage medium’) to perform thefunctions of one or more of the above-described embodiment(s) and/orthat includes one or more circuits (e.g., application specificintegrated circuit (ASIC)) for performing the functions of one or moreof the above-described embodiment(s), and by a method performed by thecomputer of the system or apparatus by, for example, reading out andexecuting the computer executable instructions from the storage mediumto perform the functions of one or more of the above-describedembodiment(s) and/or controlling the one or more circuits to perform thefunctions of one or more of the above-described embodiment(s). Thecomputer may comprise one or more processors (e.g., central processingunit (CPU), micro processing unit (MPU)) and may include a network ofseparate computers or separate processors to read out and execute thecomputer executable instructions. The computer executable instructionsmay be provided to the computer, for example, from a network or thestorage medium. The storage medium may include, for example, one or moreof a hard disk, a random-access memory (RAM), a read only memory (ROM),a storage of distributed computing systems, an optical disk (such as acompact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™),a flash memory device, a memory card, and the like.

While the present invention has been described with reference toexemplary embodiments, it is to be understood that the invention is notlimited to the disclosed exemplary embodiments. The scope of thefollowing claims is to be accorded the broadest interpretation so as toencompass all such modifications and equivalent structures andfunctions.

This application claims the benefit of Japanese Patent Application No.2018-104906, filed May 31, 2018, which is hereby incorporated byreference herein in its entirety.

What is claimed is:
 1. An image processing apparatus comprising: animage acquisition unit configured to acquire a captured image; adetection unit configured to detect an object in the acquired imageacquired by the image acquisition unit; and a transmission unitconfigured to transmit the image and metadata containing objectinformation regarding the object detected from the image by thedetection unit to an information processing apparatus, wherein thetransmission unit (303) is configured to, if a detection function of thedetection unit is valid with respect to a first image that thetransmission unit transmits and the detection function is invalid withrespect to a second image that the transmission unit transmits after thefirst image, transmit invalidity information indicating that the objectis not detected with respect to the second image to the informationprocessing apparatus.
 2. The image processing apparatus according toclaim 1, wherein the transmission unit is configured to, if a type ofthe image on which the processing for detecting the object is performedby the detection unit and a type of the second image that thetransmission unit transmits do not match each other, determine that thedetection function is invalid with respect to the second image andtransmit the invalidity information indicating that the object is notdetected with respect to the second image to the information processingapparatus.
 3. The image processing apparatus according to claim 1,wherein the transmission unit is configured to, if the type of the imageon which the processing for detecting the object is performed by thedetection unit is a fisheye image, and the type of the second image thatthe transmission unit transmits is not the fisheye image, determine thatthe detection function is invalid with respect to the second image thatthe transmission unit transmits and transmit the invalidity informationindicating that the object is not detected with respect to the secondimage to the information processing apparatus.
 4. The image processingapparatus according to claim 1, wherein the transmission unit isconfigured to, if the type of the image on which the processing fordetecting the object is performed by the detection unit is a panoramicimage, and the type of the second image that the transmission unittransmits is not the panoramic image, determine that the detectionfunction is invalid with respect to the second image that thetransmission unit transmits and transmit the invalidity informationindicating that the object is not detected with respect to the secondimage to the information processing apparatus.
 5. The image processingapparatus according to claim 1, wherein the transmission unit isconfigured to, if the image processing apparatus is set not to performthe processing for detecting the object by the detection unit withrespect to the second image that the transmission unit transmits,determine that the detection function is invalid and transmit theinvalidity information indicating that the object is not detected withrespect to the second image to the information processing apparatus. 6.The image processing apparatus according to claim 1, wherein theinvalidity information is metadata.
 7. The imaging apparatus accordingto claim 6, wherein the invalidity information is metadata including a<Frame> tag having no <Object> tag therein.
 8. The imaging apparatusaccording to claim 1, wherein the invalidity information is an eventnotification.
 9. The image processing apparatus according to claim 1,further comprising an imaging device configured to capture the capturedimage.
 10. An information processing apparatus comprising: a receptionunit configured to receive an image from an image processing apparatusand metadata containing object information about an object detected by adetection unit of the image processing apparatus; an acquisition unitconfigured to acquire information indicating whether a detectionfunction of the detection unit is valid or invalid from the imageprocessing apparatus; and a display unit configured to display,superimposed on the image, information based on the object informationcontained in the metadata received by the reception unit, wherein thedisplay unit is configured to control a display so as not to display theinformation based on the object information contained in the metadatareceived by the reception unit if the information acquired by theacquisition unit indicates that the detection function of the detectionunit is valid with respect to a first image received by the receptionunit and indicates that the detection function of the detection unit isinvalid with respect to a second image received by the reception unitafter the first image is received.
 11. An image processing methodcomprising: acquiring a captured image; detecting an object in thecaptured image; and transmitting the image and metadata containingobject information regarding the object detected from the image to aninformation processing apparatus, wherein, if a detection function ofthe detecting is valid with respect to a first image to transmit and thedetection function is invalid with respect to a second image to transmitafter the first image, invalidity information indicating that the objectis not detected with respect to the second image is transmitted to theinformation processing apparatus.
 12. An information processing methodcomprising: receiving an image from an image processing apparatus andmetadata containing object information about an object detected in thecaptured image by the image processing apparatus; acquiring informationindicating whether a detection function of the detecting is valid orinvalid from the image processing apparatus; and presenting a displaywhile superimposing information based on the object informationcontained in the received metadata on the image, wherein, if theacquired information indicates that the detection function of thedetecting is valid with respect to a received first image and indicatesthat the detection function of the detecting is invalid with respect toa second image received after the first image is received, the displayis controlled so as not to display the information based on the objectinformation contained in the received metadata.
 13. A non-transitoryrecording medium readable by a computer, the recording medium storing aprogram for causing the computer to function as an image acquisitionunit configured to acquire a captured image, a detection unit configuredto detect an object in the captured image acquired by the imageacquisition unit, and a transmission unit configured to transmit theimage and metadata containing object information regarding the objectdetected from the image by the detection unit to an informationprocessing apparatus, wherein the transmission unit is configured to, ifa detection function of the detection unit is valid with respect to afirst image that the transmission unit transmits and the detectionfunction is invalid with respect to a second image that the transmissionunit transmits after the first image, transmit invalidity informationindicating that the object is not detected with respect to the secondimage to the information processing apparatus.
 14. A non-transitoryrecording medium readable by a computer, the recording medium storing aprogram for causing the computer to function as a reception unitconfigured to receive an image from an image processing apparatus andmetadata containing object information about an object detected by adetection unit of the image processing apparatus, an acquisition unitconfigured to acquire information indicating whether a detectionfunction of the detection unit is valid or invalid from the imageprocessing apparatus, and a display unit configured to display,superimposed on the image, information based on the object informationcontained in the metadata received by the reception unit, wherein thedisplay unit is configured to control a display so as not to display theinformation based on the object information contained in the metadatareceived by the reception unit if the information acquired by theacquisition unit indicates that the detection function of the detectionunit is valid with respect to a first image received by the receptionunit and indicates that the detection function of the detection unit isinvalid with respect to a second image received by the reception unitafter the first image is received.